Efficiently Mining Frequent Closed Partial Orders ( Extended Abstract )

نویسندگان

  • Jian Pei
  • Jian Liu
  • Haixun Wang
  • Ke Wang
  • Philip S. Yu
  • Jianyong Wang
چکیده

Example 1 (Motivation) Suppose MapleBank in Canada wants to investigate whether there is some orders which customers often follow to open their accounts. A database DB in Table 1 about four customers’ sequences of opening accounts in MapleBank is analyzed. Given a support threshold min sup, a sequential pattern is a sequence s which appears as subsequences of at least min sup sequences. For example, let min sup = 3. The following four sequences are sequential patterns since they are subsequences of three sequences, 1, 2 and 4, in DB. CHK → MMK → MORT → RESP; CHK → MMK → MORT → BROK; CHK → RRSP → MORT → RESP; CHK → RRSP → MORT → BROK The sequential patterns capture the frequent account opening patterns shared by customers. However, the four sequential patterns cannot completely capture the ordering shared by customers 1, 2 and 4. It is easy to see that a partial order R as shown in Figure 1 is shared by the three account opening sequences. The partial order R summarizes the four sequential patterns – the four sequential patterns are paths in partial order R. It also provides more information about the ordering than the sequential patterns.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On compressing frequent patterns q

A major challenge in frequent-pattern mining is the sheer size of its mining results. To compress the frequent patterns, we propose to cluster frequent patterns with a tightness measure d (called d-cluster), and select a representative pattern for each cluster. The problem of finding a minimum set of representative patterns is shown NP-Hard. We develop two greedy methods, RPglobal and RPlocal. ...

متن کامل

Mining Top-K Frequent Closed Patterns without Minimum Support

In this paper, we propose a new mining task: mining top-k frequent closed patterns of length no less than min `, where k is the desired number of frequent closed patterns to be mined, and min ` is the minimal length of each pattern. An efficient algorithm, called TFP, is developed for mining such patterns without minimum support. Two methods, closed node count and descendant sum are proposed to...

متن کامل

Using attribute value lattice to find closed frequent itemsets

Finding all closed frequent itemsets is a key step of association rule mining since the non-redundant association rule can be inferred from all the closed frequent itemsets. In this paper we present a new method for finding closed frequent itemsets based on attribute value lattice. In the new method, we argue that vertical data representation and attribute value lattice can find all closed freq...

متن کامل

Simultaneous mining of frequent closed itemsets and their generators: Foundation and algorithm

Closed itemsets and their generators play an important role in frequent itemset and association rule mining. They allow a lossless representation of all frequent itemsets and association rules and facilitate mining. Some recent approaches discover frequent closed itemsets and generators separately. The Close algorithm mines them simultaneously but it needs to scan the database many times. Based...

متن کامل

CLAIM: An Efficient Method for Relaxed Frequent Closed Itemsets Mining over Stream Data

Recently, frequent itemsets mining over data streams attracted much attention. However, mining closed itemsets from data stream has not been well addressed. The main difficulty lies in its high complexity of maintenance aroused by the exact model definition of closed itemsets and the dynamic changing of data streams. In data stream scenario, it is sufficient to mining only approximated frequent...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005